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Abstract. In this paper, we first introduce a lower bound technique for the state com- 
plexity of transformations of automata. Namely we suggest first considering the class of 
full automata in lower bound analysis, and later reducing the size of the large alphabet 
via alphabet substitutions. Then we apply such technique to the complementation of non- 
deterministic ^-automata, and obtain several lower bound results. Particularly, we prove 
an f2((0.76n) n ) lower bound for Biichi complementation, which also holds for almost ev- 
ery complementation or determinization transformation of nondeterministic cj-automata, 
and prove an optimal (Q(nk)) n lower bound for the complementation of generalized Biichi 
automata, which holds for Streett automata as well. 



The complementation problem of nondeterministic w-automata, i.e. nondeterministic 
automata over infinite words, has various applications in formal verification. For example 
in automata-theoretic model checking, in order to check whether a system represented by 
automaton A\ satisfies a property represented by automaton A2, one checks that the inter- 
section of Ai with an automaton that complements A2 is an automaton accepting the empty 
language [Kur94, VW94J. In such a process, several types of nondeterministic w-automata 
are concerned, including Biichi, generalized Biichi, Rabin, Streett etc., and the complexity 
of complementing these automata has caught great attention. 

The complementation of Biichi automata has been investigated for over forty years 
|Var07j . The first effect ive construction was given in |Buc62j . and the first exponential 
construction was given in |SVW85| with a 2°( n ) state blow-up (n is the number of states 
of the input automaton). Even better constructions with 2°( nlogra ) state blow-ups were 
given in [SaTMl IKlaQH IKVOlj . which match with Michel's n\ = 2 ni ~ nl °^ lower bound 
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|Mic88| . and were thus considered optimal. However, a closer look reveals that the blow-up 
of the construction in [KVOlj is (6n) n , while Michel's lower bound is only roughly (n/e) n = 
(0.36n) n , leaving a big exponential gap hiding in the asymptotic notation^. Motivated by 
this complexity gap, the construction in |KV01| was further refined in [FKV06J to (0.97n) n . 
On the other hand, Michel's lower bound was never improved. 

For generalized Buchi, Rabin and Streett automata, the best known constructions are 
in |KV05bl lKVu5a] . which are 2°( nlo s nfc ), 2°( nfclo s n ) and 2°( nfclo s nfc ) respectively. Here 
state blow-ups are measured in terms of both n and k, where k is the index of the input 
automaton. Optimality problems of these constructions have been vastly open, because only 
2^(niogn) i ower bounds were known by variants of Michel's proof |Lod99j . 

What remains missing are stronger lower bound results. Tighter lower bounds usually 
lead us into better understanding of the intricacy of the complementation of nondeterministic 
u;-automata, and are the main concern of this paper. Such understanding can suggest 
methods to further optimize the constructions, or to circumvent those difficult cases in 
practice. 

To understand why we have so few strong lower bounds, we observe that at the core of 
almost every known lower bound is Michel's result, which was obtained in the traditional 
way. That is, one first constructs a particular family of automata (A n ) n >i, an d then proves 
that complementing each A n requires a large state blow-up. The A n +i of Michel's automata 
family is depicted in Figure [Q Although each A n +\ has a simple structure, it is not straight- 
forward to see what language it accepts, and nor is it clear at all how we can work with this 
automaton for lower bound. 




l...n, ft l...n,(j l...n,Jt 



Figure 1: Michel's Automata Class 

In many cases, identifying such an automata family is difficult, and is the main obsta- 
cle towards lower bounds. In this paper, we propose a new technique to circumvent this 
difficulty. Namely, we suggest first considering the family of full automata in lower bound 
analysis, and later reducing the size of the large alphabet via alphabet substitutions. A 
simple demonstration of such technique is presented in Section O 

With the help of full automata, we tighten the state complexity BC(n) of Buchi com- 
plementation from (0.36ra) n < BC(n) < (0.97n) n to (0.76ra) n < BC(n) < (0.97ra) n . Sur- 
prisingly, this (0.76n) n lower bound also holds for every complementation or determinization 
transformation concerning Buchi, generalized Biichi, Rabin, Streett, Muller, and parity au- 
tomata. As to the complementation of generalized Biichi automata, we prove an (Q(nk)) n 
lower bound, matching with the (0(nk)) n upper bound in |KV05bj . This lower bound also 
holds for the complementation of Streett automata and the determinization of generalized 

^In contrast, for the complementation of nondeterministic finite automata over finite words, the 2 n blow- 
up of the subset construction [RS59J was justified by a tight lower bound [SS78], which works even if the 
alphabet concerned is binary |Jir05| . 



LOWER BOUNDS FOR COMPLEMENTATION OF u> -AUTOMATA 



3 



Buchi automata into Rabin automata. A summary of our lower bounds is given in Section 
El 

Full Automata and Sakoda and Sipser's Languages. It turns out that the notion 
of full automata is similar to Sakoda and Sipser's languages in [SS78]. Their language B n 
actually corresponds to the A-graphs of the words accepted by some full automaton. Also as 
pointed to us by Christos A. Kapoutsis, the technique of alphabet substitution was somewhat 
implicit in Sakoda and Sipser's paper (but presented in a somewhat obscure way, refer to 
the paragraph before their Theorem 4.3.2). So the full automata technique is more like a 
new treatment of some techniques in the Sakoda and Sipser's paper, rather than a totally 
new invention. Compared to Sakoda and Sipser's languages, the notion of full automata 
enjoys a simple definition and is very handy to use. It is also more readily to be extended 
to other kinds of automata like alternating automata. 

For unclear reasons, Sakoda and Sipser's languages were rarely applied to fields other 
than 2-way automata after their paper. We hope that our treatment will make a clear 
exposition of the techniques and demonstrate their usefulness in problems on automata over 
one-way inputs as well. 

2. Basic Definitions 

A (nondeterministic) automaton is a tuple A = (E,S,I,A,*) with alphabet S, finite 
state set S, initial state set 7 C S } transition relation ACSxSxS and * some extra 
components. Particularly A is deterministic if |7| = 1 and for all p G S and a G S, 
\{q£S\ (p,a,q) G A}| < 1. 

For a word w = a(0)a(l) . . . a(l — 1) G X* with length(w) = I > 0, a finite run of A from 

state p to q over w is a finite state sequence p = p(0)p(l) . . . p(l) G S* such that p(0) = p, 

p(l) = q and (p(i), a(i), p(i + 1)) G A for all < i < I. We say that p visits a state set T if 

p(i) G T for some < % < I. We write p —> q if a finite run from p to q over w exists, and 

p — > q if in addition the run visits T. 
T 

A {Nondeterministic) Finite Word Automaton (NFW for short) is an automaton A = 
(E, S, 7, A, F) with final state set F C S. A finite word w is accepted by A if there is a 
finite run over w from an initial state to a final state. The language accepted by A, denoted 
by C(A), is the set of words accepted by A, and its complement E*\£(A) is denoted by 
C C {A). 

For an w-word a = a(0)a(l) • • • G i.e., an infinite sequence of letters in S, a (infinite) 
run of A over a is an infinite state sequence p = p(Q)p(l) • ■ ■ G S u such that p(0) G 7 and 
(p(i),a(i),p(i + 1)) G A for all i > 0. We let Occ(p) = {q G S \ p(i) = q for some i G N}, 
I n f{p) = {Q ^ S \ p(i) = q for infinitely many i G N}, and write , Z2] to denote the infix 
p(h)p(h + l)...p(h) of p. 

An (nondeterministic) co-automaton is an automaton A = (S, S, I , A, Acc) with accep- 
tance condition Acc, which is used to decide if a run p of A is successful. There are many 
types of w-automata considered in the literature |Tho9Q| . Here we consider six of the most 
common types: 

• Buchi automaton, where Acc = F C S is a final state set, and p is successful if 
Inf(p) n F ^ 0. 
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• generalized Biichi automaton, where Acc = {F\, . . . ,F/,} is a list of final state sets, 
and p is successful if Inf(p) n Fi ^ for all 1 < i < k. 

• Rabin automaton, where Acc = {{G±, B\), . . . , (Gk, -Bfc)} is a list of pairs of state 
sets, and p is successful if for some 1 < i < k, Inf(p) n G{ ^ and Inf(p) n Bi = 0. 

• Streett automaton, where Acc = {(G\,Bi), . . . , (Gk,Bk)} is a list of pairs of state 
sets, and p is successful if for all 1 < i < k, if Inf(p) (~l Bi ^ 0, then Inf(p) n G t ^ 0. 

• Muller automaton, where Acc = T C Vowerset(S) is a set of state sets, and /? is 
successful if Inf(p) G J 7 . 

• parity automaton, where Acc is a mapping c : S" — > {0. . . Z}, and /) is successful if 
min{c(g , )|g i £ Inf(p)} is even. 

An u-word a is accepted by A if it has a successful run. The oo-language accepted by A, 
denoted by £(A), is the set of u- words accepted by A, and its complement S aJ \£(A) is 
denoted by C C (A). The number k, if defined, is called the index of A. 

We refer to the above six types of a;-automata as the common types. Following the 
convention in |KV05aj . we will use acronyms like NBW, NGBW, NRW etc. to refer to 
Nondeterministic Buchi/generalized Buchi/Rabin/etc. Word automata. Two simple facts 
about these common types of w-automata are useful for us: 

fAct 2.1. [LodO!)]^,) For every NBW A and every common type T, there exists an T 
automaton A' with the same number of states such that A' is equivalent to A. 

(2) For every deterministic co-automaton A of a common type T which is not Biichi 
nor generalized Biichi, there exists a deterministic uj-automaton A' of a common type (not 
necessarily also T) with the same number of states (and index, if applicable) such that A! 
complements A. 

To visualize the behavior of automata over input words, we introduce the notion of A- 
graphs. If A = (E, S, I, A, *) is an automaton, then for a finite word w = a(0)a(l) . . . a(l — 
1) G S* of length I, or an w-word w = a(0)o(l) • • • G S w of length / = oo, the A-graph of w 
under A is the directed graph Q$ = (V^f , E^) with vertex set = {(p, i) \ p G S,0 < i < 
l,i £ N} and edge set E~- defined as: for all p,q G S and < i < I, ((p, i), (q, i + 1)) G E^ 
iff (p, a(i),q) G A. For a subset T of S, we say that a vertex (p, i) is a T- vertex if p G T. By 
definition p q iff there is a path (in the directed sense) in Q\~ from (p, 0) to (q, length(w)) 
and p q if furthermore the path visits some T- vertex. 

Finally we define the state complexity^ functions. Assume that T is either NFW or 
some common type of w-automata. Then for a T automaton A, C-r(A) is defined as the 
minimum number of states of a T automaton that complements A, i.e., accepts C C (A). For 
n > 1, Cr(n) is the maximum of CV(A) over all T automata with n states. If indices are 
defined for T , then Cr(n, k) is the maximum of Cr(A) over all T automata with n states 
and index k. 



In some literature, instead of merely counting the number of states, sizes of transition relations etc. are 
also taken into account to better measure the sizes of automata. Here we prefer state complexity because it 
is a measure easier to study, and its lower bound results usually imply lower bounds on "size" complexity, if 
the automata witnessing the lower bound are over a not too large alphabet. 
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3. The Full Automata Technique 

In the recently emerging area of state complexity (see [Yu05j for a survey) or in the 
theory of w-automata, we often concern proving theorems of such flavor: 

Theorem 3.1. |Jir05| For each n > 1, there exists an NFW A n with n states over {a,b} 
such that C^Fwi-An) > 2 n 



In other words, we want to prove a lower bound for the state complexity of a transfor- 
mation (NFW complementation in this case, can be determinization etc.), and furthermore, 
we hope that the automata family witnessing the lower bound ((«4 n )n>i in this case) is 
over a fixed small alphabet. Such claims are usually difficult to prove. The apparently easy 
Theorem 13. II was not proved until 2005 by a very technical proof in |Jir05j^, after the efforts 



in |SS78llB~ir93UHK0"2] . To understand the difficulty involved, we first review the traditional 
approach people attempt at such results: 

Step I: Identify an automata family (-4 n )n>i with each A n having n states. 

Step II: Prove that to transform each A n needs a large state blow-up. 
Almost every known lower bound was obtained in this way, including Theorem 13.11 and 
the aforementioned Michel's lower bound. In such an approach, Step I is well-known to be 
difficult. Identifying the suitable family (A n ) n >i requires both ingenuity and luck. Even 
worse, most automata families that people try are natural ones with simple structures, while 
the ones witnessing the desired lower bound could be highly unnatural and complex. Finding 
the right family (A n )n>i seems to be a major obstacle towards lower bound results. 
Now we introduce the notion of full automata to circumvent this obstacle. 

Definition 3.2. Given state set S, initial state set I, and extra components *, &full automa- 
ton A = (E, S, I, A, *) is an automaton with alphabet E = Vowerset{S x S) and transition 
relation A defined as: for all p,q £ S and a S E, (p, a, q) € A iff (p, q) € a. 

By definition, the alphabet contains every binary relation over S, and therefore is of a 
big size of 2l 5 ' . Due to such rich alphabets, every automaton has some embedding in a full 
automaton with the same number of states. It is then not difficult to see that transforming 
an automaton can be reduced to transforming a full automaton, and full automata are the 
most difficult automata to transform. 

To be specific, if we consider NFW complementation, then: 

Theorem 3.3. For all n > 1, Cnfw(^) = Cnfw(-4) for some full NFW A with n states. 

The theorem follows from the following lemma. 

Lemma 3.4. If A\ is an NFW with n states, then there is a full NFW A2 with n states 
such that Cnfw("4-2) > Cnfw(*4i)- 

Proof. By definition of Cnfw, it suffices to show that for some full NFW A2 with n states, 
if there is an NFW CA2 that complements A2, then there is an NFW CA\ complementing 
A\ with the same number of states as CA2. 

Let Ai = (Ei, Si,h, Ax, Fx), and consider the full NFW A 2 = (E 2 , Si, h, A 2 , Fx) with 
respect to Si, Ix and Fx. For each ax € Ei, define letter Ai(ai) in E 2 = 7-* (Si x Si) as: 
ipiiQl) ^ Ai(ai) iff (pi,ai,qx) € Ai, for all px,qi £ Si. By definition of full automata, 



The result is actually slightly stronger in that his An has only one initial state. (In some literature 
NFWs are not allowed to have multiple initial states.) 
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{pi,a 2 ,qi} G A 2 iff (pi,gi) G 02, for all Pi,qi G Si, a 2 € S 2 . So we have (pi,ai,gi) € 
Ai iff (pi, Ai(ai), gi) € A 2 , for all a± G Si,pi,gi G Si. For an arbitrary word a = 
o(0)a(l)...a(Z-l) G £*, consider word a' = Ai(a(0))Ai(o(l)) . . . Ai(o(i - 1)) G S 2 . Then 
every state sequence p\ = pi(0)pi(l) . . . pi(l) € S* is a run of A\ over a iff p\ is a run of A 2 
over a'. Since .4.1 and A 2 share the same initial and final state sets, p\ is successful iff p 2 is 
successful. So a G C(Ai) iff a' G £(.A 2 ). 

Let CA 2 = {T, 2 ,ScJc,^C,Fc) be an NFW that complements £{A 2 ). So a' G £(.A 2 ) 
iff a' <£ £{CA 2 ). Define 04i to be the NFW (Si, S c , la, A c , F c )> where A' c is defined as 
(p 2 ,ai,q 2 ) G A' c iff (p 2 , Ai(oi), q 2 ) G Ac, for all p 2 ,q 2 G Sc and ai G Si- Similarly every 
state sequence pc = pc(0)pcO-) ■ ■ ■ Pci}) G S c is a successful run of CA 2 over a' iff pc is a 
successful run of CA\ over a. So a' G C(CA 2 ) iff a G £(£Ai). 

Now for every a G S*, a G £(.Ai) iff a ^ £(C4.i). Therefore CA± with the same number 
of states as CA 2 complements Ai as required. □ 

Theorem 13.31 implies that to prove a lower bound for NFW complementation (without 
taking the size of the alphabet into account), we can simply set (A n ) n >i to be some family 
of full NFWs in Step I. Similarly, the same applies to NBW complementation: 

Theorem 3.5. For all n>l, Cnbw(^) = Cnbw(-4) for some full NBW A with n states. 

Now we apply full automata to obtain a simple proof of Theorem 13.11 

Proof, (of Theorem 13 . 1 [) We first prove a 2 n lower bound for Cnfw(^)- For each n > 1, 
let TAn = (S n , S n , I n , A n , F n ) be the full NFW with S n = I n = F n = {s , ... , s n _i}. It 
suffices to prove that C^FwiFAn) > 2 n . 

For each subset T C S n , let Id(T) denote the letter {{q, q) | q G T} and let ut = Id(T), 



vt = Id{S n \T). Figure 2(a) depicts one example of ut^t's A-graph. Since all states in 
TAn are both initial and final, a word w of length I is accepted by TA n iff there is a path 
from an (sj, 0) vertex to an (sj,l) vertex in the A-graph of w under TA n . In particular 
utvt is not accepted by TAn. Suppose that some NFW CA complements TA n . So for each 
T C S n , there is a state qx of CA such that qi — > qx and qx — * qF for some initial state qi 
and final state g> of CA. If we prove that qxi 7^ 5t 2 whenever T\ ^ T 2 , then CA has at least 
2™ states as required. Suppose by contradiction that qr x = qx 2 for some T\ ^ T 2 . W.l.o.g. 

there is a state s of TA n in Ti\T 2 . Then s — > s — > s and hence ut 1 vt 2 G £(^ r ^l n ). On 
the other hand, for some initial state qi and final state <jf of C^l, qi — > q~Ti = <Zt 2 — > qF- 
So ut 1 vt 2 G £(CA), contradiction. 

The above proof is not fully satisfying in that the automata family witnessing the 
lower bound is over an exponentially growing alphabet. To fix a binary alphabet and prove 
Theorem 13.11 we introduce a Step III in which we do "alphabet substitution", as we now 
illustrate. 

We first refine the above proof of C^Fy/(TAn) > 2 n by restricting the number of different 
letters involved. For two words u, v G S*, we say that u is equivalent to v with respect to 
TA n , or simply u ~ v, if for all p,g 6 S n , p — > q iff p — > q. A little thought shows that 
if we substitute each Id(T) letter used in the above proof by some equivalent words, the 
proof still works. First we consider the alphabet {cj}o<i< n with Cj = Id(S n \{si}). Then for 
each T C S n , Id(T) ~ n^^TCj, the concatenation of all cj'a with s« ^ T in lexicographical 



order (any other fixed order will do). This is illustrated in Figure 2(b) Then consider the 



alphabet {a, b} with a = {(sj + i,Sj) |0<i<n — 1} U {(sch Sn-i}} and b = Id(S n \{so}), 
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►•Si 
►•S 2 
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(a) utvt 



(b) c lC3 ~ Id{T) 
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a 
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a 




(c) abaaa ~ ci 




Figure 2: Examples 



then for each < i < n, Ci ~ a l foa n l , as illustrated in Figure 2(c 



. So if we substitute each 
letter Id(T) in the above proof by the equivalent word Il Si ^j'a l ba n ~ t , the proof still works. 

After the above refinement of the proof, the part of TA n related to letters other than 
{a, b} is in fact irrelevant to the proof. So A n = TA n \ {a,b}, the restriction of TAn to 
{a, 5}, or formally the NFW A n = ({a, 5}, S n , I n , A n n (S n x {a, b} x 5 n ), F n ), also satisfies 
that CNFw(-4n) > 2 n , as required (A^ is depicted in 2(d) ). □ 



We call the above technique of setting (^4 n ) ra >i to be a family of full automata and 
adding the step of alphabet substitution the "full automata technique". Setting {A n ) n >i to 
be full automata is crucial here, which in essence delays the trouble of identifying (^4 n )n>i 
to the later analysis of transforming full automata. This makes our life easier because the 
latter is usually playing with words, which is clearly easier than constructing automata, 
especially with the rich alphabet of full automata. As to the step of alphabet substitution, 
our experience is that it could be technical some time, but rarely difficult. 



4. BOchi Complementation 



4.1. Kupferman and Vardi's Construction. We first briefly introduce the state-of-the- 
art construction for Biichi complementation by Kupferman and Vardi in |FKV06j . the idea 
of which is important in our lower bound. Different from |FKV0 6]. we will continue to 
work with our A-graphs rather than introducing the notion of run graphs. For x G N, let 
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[x] denote the set {0, 1, . . . ,x} and let [x] odd and [x] even denote the sets of odd and even 
numbers in [x] respectively. 

Definition 4.1. Given an NBW A = (E, 5, /, A, F) of n states, and an w-word a, a co- 
Biichi ranking (C-Ranking for short) for Q£ (i.e. the A-graph of a under A) is a partial 
function / from V£ to the rank set [2n — 2] such that: 

(i) : For all vertices (q, I) G f((q, I)) is undefined iff there is no path (in the directed 
sense) from some (qi,0) vertex with qi G I to (q, I). 

(ii) : For all vertices (q,l) G V£, if f((q,l)) is odd, then q F. 

(iii) : For all edges ({q,l),(q',l + 1)) 6 E£, if f({q,l)) is defined, then f({q,l)) > 

We say that / is odd if for every path in Q£, there are infinitely many vertices that are 
assigned odd ranks by /. 

Lemma 4.2. |KV01j The uj-word a is not accepted by A iff there is an odd C-ranking for 

Proof. We prove the if direction here to give a sense of the idea of C-ranking. For every 
infinite path from a (qi,0) vertex for some qi £ I, the ranks along the path do not increase 
by (iii) and so will get trapped in some fixed rank from some point on. Since / is odd, this 
fixed rank is odd, and thus by (ii), F- vertices are never visited since then. In other words, 
every run of A over a visits F finitely often and hence a is not accepted by A. □ 

A level ranking for A is a partial function g : S — > [2n — 2] such that if g(q) is odd, 
then q £ F. Each C-ranking can be "sliced" into such level rankings. It was shown in [KV01] 
that existence of an odd C-ranking for Q£ can be decided by an NBW CA which guesses 
an odd C-ranking level by level, and checks the validity in a local manner. By Lemma [4721 
CA complements A. In the construction of CA, distinct sets of states are used to handle 
different level rankings, and the number of such level rankings is the major factor of the 
(6n) n blow-up. 

We say that a level ranking g for A is tight if (i): the maximum rank in the range of g 
is some odd number 2m — 1 in [2n — 2] odd , and (ii): for every j G [2m] odd , there is a state q 
with g(q) = j. In such a case, g is also called a TL(m)-ranking (with 1 < m < n). It was 
further shown in [FKV06] that we can restrict attention to tight level rankings and use less 
states in CA. By a careful numerical analysis [FKV06] , a (0.97) n upper bound was proved 
for the number of states of CA and thus for Buchi complementation. 

4.2. Lower Bound. We turn now to lower bound. By Theorem 13.51 it suffices to consider 
full NBWs. We define TB n for re > 1 to be the full NBW (E n , S n , I n , A„, F n ) with J n = 
{so, . . . , Sn-2}, F n = {sj} and S n = I n U F n . We also use S' n = I n to denote the "main" 
states. 

We first try to construct an w-word a n not accepted by TB n such that a great number of 
tight level rankings would have to be present in every C-ranking for Q^® " ■ Since the number 
of tight level rankings is the major factor of the state blow-up in Kupferman and Vardi's 
construction, this would produce a hard case for the construction. For such purpose, we 



Our definitions of level ranking and tight level ranking here are slightly different from [FKV06| . 
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consider a special class of tight level rankings for TB n , Q-rankings. We say that a TL(to)- 
ranking g for TB n is a Q{m) -ranking if g (q) is defined for each q G S' n and is undefined for 
q = Sf. We start defining our difficult w-word a n by defining its composing segments. 

Lemma 4.3. For every pair of Q -rankings (f,g), there exists a word Wf^ g such that: 



(i) : For all p,q G S' n , p 

(ii) : For all p,q G S' n , p 



W f,9 



"f,9 



q *ff (MP) > fi+l(q) or fi(p) = f l+1 (q) G [2m]° M ). 

q iff fib) > fi+i(q)- 



(iii): For allp,q G S n , if p — q thenp,q £ F n . 

Proof. We first illustrate the construction using a typical example depicted in Fig. [3l As in 
Fig. [3j the vertices of the A-graph of Wf t9 are separated by the wider space below c(f,g) 
into two parts. We say that each (sj, j) vertex in the left part is ranked /(sj) by /, and 
each (si,j) vertex in the right part is ranked g(s{) by g. So when one follows a path from a 
leftmost vertex v\ to a rightmost vertex V2, either one goes to a next vertex with the same 
rank, or one visits a (sf,j) vertex and then goes to a vertex with a rank lower by one. This 
explains the only if direction of (ii). Also note that v\ and v% cannot have the same even 
ranks because in the middle of this process, one has to go to a vertex with an odd rank to 
pass c(f,g). So the only if direction in (i) holds too. For the if directions of (i) and (ii), 
suppose one wants to go from a leftmost vertex v\ with rank r to a rightmost vertex V2 with 
rank r' and that either r > r' or r = r' G [2m] odd . Let t be an odd rank such that r >t > r'. 
Then by the construction, one can go from v\ to some vertex with rank t in the left part, 
pass through c(f,g) with rank t, and then continue to go to v-i in the right part. Note that 
in the process, if rank ever decreases, then an (sj,j) vertex must have been visited. So the 
if directions of (i) and (ii) hold as well. Condition (iii) is obviously true. 

s d(f, 3, 2) d(/,_2, 1) c_(/, g) d(g, 3, 2) d(g, 2, 1) d(g, 1,0)^ , ) g 




Figure 3: A-graph of Wf t , 



For later purposes, we explicitly present our construction for Wf >g . For a Q(m)-ranking 
h, we define the state sets Ranker) = {q G S' n \ r = h(q)} for r G [2m] and Oddh to be the 
union of i?anfc/j(r)'s with r G [2m] orfd . Also for each T C S' n , define letters in E n as Id(T) = 
{{q,q)\qe T}, TtoF(T) = Id(S' n ) U {{q, Sf) \ q G T}, FtoT(T) = Id(S' n ) U{(s f , q) \ q G T} 
and c(f,g) = {(p,q) | f(p) = g(q) G [2m] odd , p, q G S' n }. For a Q(m)-ranking h and 
r,r' G [2m], we write d(h,r,r') to denote the word TtoF (Ranker)) ■ FtoT(Rankh(r')). 
Then if r\,r2 ■ ■ ■ ,Tk are the ranks in [2m] that are images of h in descending order, we let 
Uh = d(h, n, -d(h, r2,r%) d(h, r^-i, r^). Finally, Wf ig is defined to be Uf-c(f, g)-u g .\3 
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Lemma 4.4. Let fo, fi,---,fi be a list of Q(m)-rankings with I > 0, and let w be the word 
w h,h w h,h ■ --Wfi-ifr Also le,tp,qe S' n , then: 

(i) V HP) > fl(q) or Up) = Mq) G [2m]° dd , then p ^ q. 

(H) If Mp) > fife), then p q. 

Proof. If I = 1, then w = Wf Q j 1 , and the properties follow from Theorem 14.31 trivially. So 
we assume that I > 1. Let t be an odd rank such that fo(p) > t > fi(q). By definition 
of <5(m)-ranking, there exists a state sequence q±,q2, ■ ■ ■ ,qi-i such that fife) = t for all 

1 < % < I — 1 , So qi for all 1 < i < I — 1. Also because fo(p) > t > fi(q), we 

have p -^i 1 q\ and qi-i f ^—^ 11 q. Concatenate these together, we have p —> q, and (i) is 
satisfied. If fo(p) > fi(q), then either fo{p) > t or t > fife), and hence either p — ^ q\ or 

F n 

qi-i } ^—^ tl q. So p g, and (ii) is satisfied. □ 

F n Fn 

Let L(n,m) be the number of different Q(m)-rankings and let L(n) be max L(n,m). 

l<m<n 

From now on we fix m such that L(n) = L(n, m) and may simply write L for L(n). Clearly 
there exists an infinite looping enumeration /o,/i, • • • of Q(m) -rankings such that /, ^ fj 
for all i ^= j,0 < i,j < L, and fa = fjL+i for all i,j > 0. Our "difficult" w-word a n is then 
the w-word wqW\ . . . where Wi = Wf u f i+1 for all i > 0. 

Lemma 4.5. T/ie u)-word a n is not in LiF&n). 

Proof. If there is a successful run p of TB n over a n , then there is an infinite state sequence 
qoqi ■ ■ ■ E S% such that % — ^ (ft+i for alH > and % — ^ for infinitely many ieN. So 

by the construction of W4 = Wf u f i+1 , fife) > fi+i(qi+i) for alH > and fife) > f i+1 fe + i) 
for infinitely many i G N. This is impossible since fo(qo) is finite. □ 

Recall that Kupferman and Vardi's construction uses distinct state sets to handle differ- 
ent TL(m)-rankings. It turns out that if a complement automaton of TB n does not have as 
many states as Q(m)-rankings, it would be "confused" by a n together with another complex 
a;- word a' derived from a n . 

Lemma 4.6. For each n > 1 and each u- automaton CA with less than L states, if p is a 
run ofCA over a n ^ C{J-B n ), then there is a run p of CA over some uj-word a' € C{J-B n ) 
with Occ(p') = Occ(p) and Inf(p') = Inf(p). 

Proof. Suppose that CA = (E n , S, I, A, Acc) is an w-automaton with less than L states and 
p = p(0)p(l) ■ ■ ■ G S u is a run of CA over a n . Let ko, k\, . . . be a number sequence such 
that ko = 0, ki + i — ki = length(wi) for all i > 0. So the fcj's mark the positions where the 
Wi's concatenate. Therefore p{ki) — ^> piki+x) for all i > 0. Define for each < i < L the 
nonempty set: 

Qi = {<1 G S I p{kjL + i) = q for infinitely many j G N}. 

Since C^4 has less than L states, there exists some state q in Qi D Qj for some i 7^ j, < 
i,j < L. In particular one has, by definition, fi 7^ /j. W.l.o.g. there is a q G with 
> fj(q)- By definitions of Qi and Occ(p), there is a ii G N sufficiently large such that 
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p(kt 1 L+i) = <L every state in Occ(p) occurs in p[0, k tl L+i], and that p(t') € Inf(p) for all 
t 1 > k tl L+i- By definitions of Inf(p) and Qj, there is a sufficiently large i 2 > t\ such that 
p(k t2 L+j) = q and every state in Inf(p) occurs in p[k tl L+i, h 2 L+j]- Let u = w . . . w tl L+i-i 
and v = w^L+i ■ ■ ■ wt 2 L+j-i- Finally let a' be uv 1 ^ . 

Let qi G S' n be such that fo(qi) = 2m - 1 > /*(<?) = f tl L+i(q)- By Lemma[4~H 9/ g. 

Similarly, since f tl L+i{q) = fi(q) > fj(q) = ft 2 L+j(q), by Lemma EES we have q q. 

F„ 

Together we have (// — ► </ — > q — ► (? . . . and a' is accepted by ^"^n. 

Finally, note that p' = p[0, k^L+i] • {p[kt x L+i + 1, ^L+j})^ is a run over a', and we have 
guaranteed that Occ(p') = Occ(p) and Inf(p') = Inf(p) as required. □ 

Theorem 4.7. For every n > I, L(n) < C^BwifBu) < CnbwW, where L(n) = G((0.76n) n ).| 

Proof. By Lemma l4~6l every NBW that complements TB n must have at least L(n) states, 
otherwise both a n and a' n would be accepted by TB n , leading to contradiction. By a numer- 
ical analysis of L(n) very similar to the one in |FKV06| . we have that L(n) = 0((O.76n) n ). 
For completeness, we present the detail of the analysis in appendix. □ 



4.3. Alphabet. Following the proof of Theorem 14.71 one constructs full NBWs witnessing 
the lower bound over a very large alphabet, which we rarely consider in practice. In this 
subsection, we show that by using alphabet substitutions like in the proof of Theorem 13.11 
the NBWs witnessing the lower bound can be also over a fixed alphabet. 

We say two words u and v from S* are equivalent with respect to TB n , or simply umv, 
if for all p,q & S' n : (i) p — —> q iff p — —> q, and, (ii) p — —> q iff p — — > q. Then if one replaces 

F n F„ 

each letter involved in the lower bound proof by an equivalent word over some alphabet T, 
one shows that TB n \ T also witnesses the same L{n) lower bound. 

Lemma 4.8. There is an alphabet T of size 7 such that for each pair (f,g) of Q(m) -rankings 
for TB n , there is a word in T* equivalent to Wf t9 . 

Proof. Let T be the alphabet containing the following 7 letters: 

• rotate = {(s i+1 ,Si) \ < i < n - 2} U {(s , s„_ 2 ), («/, s/)}, 

• clearO = Id(S n \{so}), 

• swapOl = (Id(S' n ) U {(s , si), (si, s )})\{(s , s ), (si, si}}, 

• copyOl = Id(S' n ) U {(si, s }}, 

• OtoF = Id(S n )U{(s ,s f )}, 

• FtoO = Id(S n )U{(s f ,s )}, 

• clear F = Id{S' n ). 

Only three types of letters are relevant in the proof of Theorem 14.71 TtoF(T), FtoT(T) and 
c(f,g). For each T C S' n , one can verify that: 

• TtoF(T) clear F ■ (rotate 1 ■ OtoF ■ rotate™' 1 - 1 ) . 

Si£T 

• FtoT(T) k, Y\ {rotate 1 ■ FtoO ■ rotate 11 - 1 ^) ■ clearF. 
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As to c(f,g), the task is a bit more complicated, and let us view it in a different way. For a 
word w, define set rj = {i\si — * Sj, < i < n — 1} for every < j < n — 1. Clearly for two 
words u and v, the following are equivalent: 

• p — — > q iff p — — > g for all p,q £ S' n . 

• = rj(u) for all < j < n — 1. 

So it is sufficient to find for each c(/, g) a word u> over {rotate, clearO, swapOl, copyOl} such 
that rj(w) = rj(c(f,g)) for all < j < n — 1. 

Appending each letter a to the end of a word w changes the content of the rj(iu)'s. 
Consider these three types of words in V*: 

rotate 1 ■ swapOl ■ rotate n ~ 1 ~ l if i + 1 = j 

{swap i)i+ i ■ swap i+ i )i+ 2 swapj-ij) 



(1) swapij = < 



, , if % + 1 < j 

■(swapj-2,j-i ■ swapj-zj-2 swap i%i+ \) 



(2) copy,. 



1,3 



swapj t i if i > j 

the empty word if i = j 

swapOl ■ copyOl ■ swapOl if i = 1 and j = 

swapo ; i ■ swapij ■ copyOl ■ swapij ■ swapo^ otherwise 
(3) cleari = swapo^ ■ clearO ■ swapo^ 
One can verify that appending a swapij to w exchanges the content of ri{w) and rj(w), 
appending a copyij sets ri(w) to be ri(w) Urj(w), and appending a clear \ empties ri(w). 
Obviously these three operations allow one to reach arbitrary (ri(u>))o<i<n-i configurations, 
including (n(c(/, g)))o<i <n -i, as needed. □ 

So B n = TB n \ T, the restriction of TB n to the alphabet T, satisfies that C^Bw(^n) > 
L(n), and we have: 

Theorem 4.9. For each n > 1, there exists an NBW B n with n states over a seven letters 
alphabet such that L(n) < CNBw(^n)- 



4.4. Other Transformations. Surprisingly, our lower bound on Biichi complementation 
extends to almost every complementation or determinization transformation of nondeter- 
ministic w-automata, via a reduction making use of Lemma 14.61 

Theorem 4.10. For each n > 1 and each common type T\ of nondeterministic co-automata, 
there exists a T\ automaton A n with n states over a fixed alphabet such that: 

(i) : For each common type T2, every T2 automaton that complements C(A n ) has at 
least L(n) states. 

(ii) : For each common type T2 that is not Biichi nor generalized Biich^, every deter- 
ministic T2 automaton that accepts C{A n ) has at least L(n) states. 

Proof. For each common type 71, by Fact 12.11 there is a T\ automaton A n equivalent to 
NBW TB n with also n states [L6d99]. (i) Suppose that an automaton CA of a common 
type accepts £ c (A n )= L c (TBn). Since acceptance of w-automata of a common type only 
depends on the Inf set of a run, the claim can be obtained by applying Lemma 14.61 (ii) 
If some deterministic T2 automaton with less than L(n) states accepts C(A n ), and T2 is 
not Biichi or generalized Biichi, then by Fact 12.11 there is a deterministic w-automaton of 

^Deterministic Biichi or generalized Biichi automata are strictly weaker in expressive power than the 
other common types of w-automata. 
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a common type (not necessarily 7-j) complementing C(A n ) with also less than L(n) states 
|Lod99j . contrary to (i). Finally, the alphabet of A n can be fixed like in the proof of Theorem 

1131 □ 

For the transformations involved in this theorem, less than half already had nontrivial 
lower bounds like n! by Michel's proof or the bunch of proofs by Loding [Lod99| . while 
the others only have trivial or weak 2 n ( n ' ) lower bounds. These bounds are summarized in 
Section [6l 

5. Complementation of Generalized Buchi Automata 

We turn now to NGBW complementation. For NGBWs, state complexity is prefer- 
ably measured in terms of both the number of states n and index k, where index measures 
the size of the acceptance condition. By applying full automata, doing a hard case anal- 
ysis for the construction in |KV05b| based on GC-ranking, and using a generalization of 
Michel's technique, we prove an (Q(nk)) n lower bound, matching with the (0(nk)) n bound 
in |KV05b| . This lower bound also extends to the complementation of Streett automata and 
the determinization of generalized Buchi automata into Rabin automata. 

5.1. Standard Full Generalized Biichi Automata TB n ^. We first define full NGBW 
automata which we will show to witness our desired lower bound. 

We say a generalized Biichi acceptance condition Acc = {F\,F2, . . . ,F k } is minimal, if 
no Fi,Fj pair with i ^ j satisfies that Fi C Fj. Note that if such a pair exists, Fj can 
be removed from Acc without altering the w-language defined. So we will only consider 
minimal acceptance conditions. By the Sperner's theorem in combinatorics |Lub66j . if Acc 
is minimal, then k < (u// 2 j) - 

Definition 5.1. For n > 1 and 1 < k < {u^yz}) > the standard full NGBW TB n>k = 
(S n , S n , I n , A n , Acc n ±) is an NGBW with |5 n | = n, I n = S n and a minimal acceptance 
condition Acc n ^. Let s n f be one of its state. We denote S n \{s n f} as S' n . Acc n ^ is defined 
as an arbitrary fixed set {F±,F2, . . . , F k } C V(S' n ) such that: (i) \Fi\ = [(n — l)/2j for each 
Fi E Acc n: k- (ii) For each q £ S' n , the number of Fj's in Acc Ut k that do not contain q is at 
least [k/2\. 

We must show that there is really such a minimal Acc n ^ satisfying (i) and (ii). First 
let Acc n ^k be a collection of arbitrary k distinct subsets of S' n of [(n — l)/2j states and 
thus (i) is satisfied. Define \q f° r eacn Q € S' n as the number of F^s in Acc n ^ that contain 

k 

q. By double counting, Yl Xq = Yl So if \x P — Xq\ ^ 1 f° r an P,Q ^ S' n , then for 

q&S' n i=l 

all q € S' n , Xq ^ \ i k ^ n ~^/ 2 ^ "j < \k/2] and (ii) is also satisfied. Suppose Xp ~ Xq > 1 f° r 
some p,q £ S' n . A little thought shows that there is an Fi G Acc n ^ such that p S Fi and 
(Fi\{p}) U {q} £ Acc U)k . Replace Fi in Acc njk by (Fi\{p}) U {q} and we make \x P - Xq\ 
strictly smaller. Repeat this till \x P — Xq\ ^ 1 f° r an P,Q & S' n . Then condition (ii) is also 
satisfied. 
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5.2. A Generalization of Michel's Technique. We generalize the technique used in 
Michel's proof for Biichi complementation |Mic88j so that a tighter analysis of NGBW 
complementation becomes possible. 

Definition 5.2. A generalized co-Biichi segment (GC-segment for short) w of an NGBW 
B is a word such that w u ^ C(B). Two GC-segments Wi,w 2 of B conflict if all w-words in 
the form w^ (w^w^ 2 )^ , h > are in C(B). A set W of GC-segments of B is a conflict set 
for B if every two distinct GC-segments in W conflict. 

Lemma 5.3. If W is a conflict set for NGBW B, then C NG bw(#) > \W\. 

Proof. Suppose that some NGBW CB = (S, S, I, A, F) complements B, then for each GC- 
segment w of B in W, CB accepts . For every two distinct GC-segments wx,w 2 E W, 
let l\ = length(w\), I2 = length(w2), and let p(0)p(l) . . . and p'(0)p'(l) . . . be CB's two 
successful runs over wf and respectively. Define 

Qi = {<? E 5 I p(i • Zi) = g for infinitely many i G N} 

and 

Q2 = {<} E S \ p'(i ■ I2) = q for infinitely many i E N}. 

Clearly Qi and Q2 are nonempty. It suffices to show that Q\ n Q2 = 0, since it implies that 
the number of states of CB is no less than the number of GC-segments in W. 

Suppose by contradiction that some q is in Q± C\ Q2. By definition of Q\, there is a 
sufficiently large ko > such that p(koh) = q and for each i > koh, p(i) E Inf(p). So 

p[0,koli] is a finite run over from some initial state qi of CB to g, i.e., qi — -> g. By 
definitions of Qi and Inf(p), there is a sufficiently large fci > such that p((fco + ^1)^1) = q 
and in addition p[A; • l\, (ko + fei)Zx] is a finite run from q to q over u;^ 1 which visits every 
state in Inf(p). Similarly we have that for some k' and Z«2 > 0, p'[k' Q l2,(k + ^2)^2] is a 
finite run from q to q over u;^ 2 which visits exactly every state in Inf(p'). We construct a 
new run as follows: 

Pnew — P 

[0, koh] ■ [p[k h + 1, (ko + h)h] ■ p'\k'oh + 1, (k'o + k 2 )h]f , 

which is a run over a = w^ (w^ 1 w 2 2 ) tJ with Inf(p new ) = Inf(p) Ulnf(p'). As p and p' are 
both successful, p ne iu is also successful by definition of generalized Biichi automata. So a is 
accepted by CB. However, as w\ and W2 conflict, a is accepted by B too, contradiction. □ 

Corollary 5.4. If W is a conflict set for NGBW B, then every NSW (nondeterministic 
Streett automaton) that complements B has at least \W\ states. 

Proof. Streett automata also satisfy that if p and p' are both successful runs, then every 
run p new satisfying Inf(p new ) = Inf(p) U Inf(p') is also successful. So the same proof as 
of Lemma 15.31 applies here. □ 

5.3. A Conflict Set for TB n j~. It remains to define a large conflict set for TB n ^. The 
following concept of pseudo generalized co-Biichi level ranking is adapted from the concept of 
generalized co-Biichi level ranking in the NGBW complementation construction in [KV05b| . 

Definition 5.5. A pseudo generalized co-Biichi level ranking (PGCL-ranking for short) for 
TB n fc is a pair (/, g) such that / is a bijection from 5^ to {1, . . . , n — 1} and g is a function 
from S' n to {1, 2, ... , k} such that each q E S' n is not contained in F g r q y 
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By definition of FB n> k, there are at least [k/2\ choices for the value of g(q) for each 

q G S' n . So there are at least (n — 1)! x ([k/2\) n ~ 1 many different PGCL-rankings, which is 

{yt(nk)) n by Stirling's formula. 

Let Q be a set of state sets. In the following, we use notations in the form p -^-> q to 

g,\B 

denote that there is a finite run over w from p to q such that the run visits every state set 
F in G, but it does not visit B. Either G or B will be omitted if is empty. In the following, 
we set T= {F 1 ,...,F k }. 

Lemma 5.6. For each PGCL-ranking (f,g), there exists a word segf >g with the properties 
that for all p,q G S' n : 

(i): If p = q, i-e., f(p) = f(q), then there is a unique finite run of TB n ^ over seg^ g 



from p to q, and it is in the form p 



se 9f,g 



F\F gip) ,\F g(p) 

(ii) : If f(p) > f(q), then there is a unique finite run of TB n ^ over seg^ g from p to q, 
and it is in the form p q. 

(iii) : // f(p) < f(q), then there is no finite run of TB n ^ from p to q over segf tg . 

Proof. For notational convenience, we use notation like L ®P 1 ~tf 2 ' 1 to denote letter 

{{iii} I 1 £ S' n } u {{pi,P2)}\{(P3,Pa}, (pbiPb)}- We also define a choice function c(i,p) for 
each i G {1, . . . , k} and state p G S' n with g(p) ^ i such that c(i,p) equals to some arbitrary 
fixed element in Fi\F g ( p y 

For each r G {1, . . . , n — 1}, let p G S' n be such that f{p) = r, and define: 



U r = 



n 



s=c(i,p) 



3p 
3s - 



s, Qp 

Snf, Qs 



P, 

s 



)S n / -> s, 



Qp 
Qs - 



P, 
s 



(Recall that UU means the concatenation of all words in U in lexicographical order.) Then 
for each q G S' n , there is a unique finite run over u r from q to q, and it is in the form 
q if p = q, or q ——> q otherwise. 



F\F g(p) ,lF g(p) 



IF, 



g(p) 



For each r = {2, 3, . . . , n — 1}, let p,q,s G S' n be such that f(p) = r, f(q) = r — 1 and s 
be an arbitrary state in F g ^y Define: 



3p 
3s - 



s , Qs 

Snf 



)s — > q , Qs 
3s n / -> s 



Then there is a unique finite run over v r from p to g, and it is in the form p q. Also 



9(P) 



for every q' G S^, there is a unique finite run over v r from q' to q' , and it is in the form 



\F, 



1 q ■ 



a(p) 



Finally let segf, g be u n _iu n _iu n _ 2 w n -2 • • • u 2 «i. 

To see that segf ;g satisfies the required properties, first note that for all p G S' n , p 



\F, 



P 



9(P) 



and p p. For property (i), for every p G S' n with /(p) = r, there exists a unique finite 



\F, 



a(p) 



run over segf tg , and it is in the form: 



U n -lV n -l...U r+ lV r+ l 

p >p 



IF 



s(p) 



■^\Fg(p)JF g ( p ) 



V r U r —l...V2Ul 
> P >p, 



IF, 



s(p) 
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that is, p > p as required. For property (ii), for every p,q G S' n with f(p) = r± > 

F\ F g(p), lF g(p) 

r 2 = J(q), let s r G S' n be such that f(s r ) = r for each r\ > r > r 2 . There is a unique finite 
run over segf tg , and it is in the form: 

«n-l"n-l-«r 1 + l''r 1 +l U r± V r± 

p > p — > p - — > s n _i 

F\ F g{p)S F g(p) b g{v) 
U ri -\V ri -l U r2 + iV r , 2 + l U r , 2 ...V2Ul 

► s ri _ 2 . . . s r2+ i > q > q, 

that is, p q as required. Property (iii) is easy to verify. □ 

Remark 5.7. From the proof of the above lemma, it follows that an alphabet of size 
polynomial in n is sufficient to describe {segf >g \f, g are PGCL-rankings}. 

Lemma 5.8. For each PGCL-ranking (f,g) for TB n ^, word segf^ g is a GC-segment of 

Proof. Let I = length(segf :g ), and let p = p(0)p(l) ... be a run of TB n ^ over seg^ in the 

form p(0) — > 9 p(l) — f p(2l) .... Note that by the construction of segf^ g , p(i ■ I) G S' n and 
f(p(i ■ /)) is defined for all i > 0. Then by property (iii), /(p(0)) > f(fi(l)) > /(p(2/)) >... 
and then for some t G N, f(p(t' ■ /)) = f(p(t ■ I)) for all t' > t, that is p(t' ■ I) = p(t ■ I) 
for all t' > t since / is a bijection. Let j = g(p(t • /)). By property (i), Fj is not visited 
in p[t' • I, (t' + 1) • /] for all t' > t. So Inf(p) (1 Fj = and hence seg^ is not accepted by 
FB n , k . ' □ 

Lemma 5.9. The set W = {segf >g \ (f,g) is a PGCL-ranking for TB n ^\ is a conflict set 
of size (fl(nk)) n for TB n ^. 

Proof. Suppose (fi,gi) and (f 2 ,g 2 ) are two distinct PGCL-rankings. Let w\ = segf lt91 and 
w 2 = segf 2t92 . There are two cases. 

Case: I: f\ and f 2 are two different bijections. So there exist p,q G S' n such that 

flip) > hil) and f 2 (p) < f 2 (q). By property (i), p p, q q and so p J — ► 

^m— 1 

p, q q for all m > 0. By property (ii), p — l -* q and q — ^ p. So for all m > 0, 

p q and g — p. Now for every w-word a in the form ti^" (tf 1 1 w 2 2 ) W ) &i > 0) we 

fen fci fco fci fco 

u^" m^ 1 ui 2 U)]^ w 2 

construct a successful run over a asp — ► p — > q — > p — > q — ► p . . . . So a is 

T T T T 

accepted by TB n ^ and w\ conflicts with w 2 . 
Case: II: f\ = f 2 but g\±g 2 - Let p G S' n be such that gi(p) / g 2 (p). By property (i), 

p — > p and p — > p. As g\ (p) ^ g 2 (p), p — - — p for every 

• 7r \- F Sl(p). ! - F gi(p) :F \ i; s2(p)' ! - F 92(p) T 

k±,k 2 > 0. Now for every w-word a in the form w\° (w^ 1 w^ 2 ) w , fcj > 0, we construct 

fcn fcl fco fci fco 

a successful run over a as p — > p ► p > p . . . . So a is accepted by 

TBnfc and w\ conflicts with w 2 . 

Finally, the size of W is just the number of different PGCL-rankings for TB n ,k, which is 
(n(nk)) n . □ 
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5.4. Results. 

Theorem 5.10. For n > 1 and 1 < k <(|( n l7)/ 2 j)> Cngbw(™, k) = {Vt(nk)) n . 

Proof. The theorem follows from Lemma 15.31 and Lemma 15.91 directly. □ 

This matches neatljH with the (0(nk)) n construction in |KV05bj . and thus settles the 
state complexity of NGBW complementation. Like Michel's result, this lower bound can 
be extended to NSW complementation and the determinization of NGBW into DRW (state 
complexity denoted by -DNGBW^DRw( n > k)): 

Theorem 5.11. For all n > 1 and 1 < k < ( i( n 1^y 2 i ) , Cnsw(»i, fc) = (Q(nk)) n and 
D NGB w-*DRw(n,k) = (Q(nk)) n . 

Proof. By Fact 12. II there is an NSW S n ^ equivalent to each TB n ^ with the same number of 
states and the same index. By Corollary 15.41 and Lemma l5l?l every NSW that complements 
TB n fr has (Vt(nk)) n states. So Cnsw («5 n ,fe) = (£l{nk)) n and C^sw{ n ,k) = (Vt{nk)) n . 

Suppose by contradiction that 1Z is a DRW with less than \W\ states that accepts 
djFBn^k), then by Fact 12.11 there is a DSW S complementing TB n ^ with the same number 
of states as 1Z, contrary to Corollary I5.4L So -DNGBW^DRw( n > = (Q(nk)) n . □ 

Remark 5.12. For the above lower bound, by Remark 15.71 the alphabet involved in the 
proof is of a size polynomial in n. It seems difficult to fix a constant alphabet, but we 
conjecture this to be possible if we aim at a weaker bound like 2 Q ( nlognk \ 



6. Summary 

In the following table, we briefly summarize our lower bounds. Here "Any" means 
any common type of nondeterministic w-automata (and the two Any's can be different), 
"co." means complementation and "det." means determinization. "L.B." /"U.B." stands for 
lower/upper bound. Weak 2 n ( n ) lower bounds are considered trivial. 



# 


Transformation 


Previous L.B. 


Our L.B. 


Known U.B. 


1 


NBW NBW 


n((0.36n) n ) [Mic88] 


ft((0.76n) n ) 


O((0.97n) n ) [FKV06] 


2 


A co. or det. A 
Any — ► Any 


trivial or n\ [L6d99] 


2p{n log n) 




3 


NBW DMW 


trivial^ 


2$i(n log n) 


^nlogn) | Saf89 j 


4 


NRW NRW 


trivial^ 


2 p.(n log n) 


2 0(nk log n) ^V05aJ 


5 


NGBW NGBW 


fi((n/e) n ) [Mic88] 


(n(nk)) n 


(0(nk)) n [KV05b] 


6 


NSW NSW 


n((n/e) n ) [Lod99] 


(n(nk)) n 


2 0(nk\ og (nk)) ^ KV 05aJ 


7 


NGBW DRW 


n((n/e) n ) [Lodaal 


(n(nk)) n 


2 0(nk\og(nk)) |g af89 j 



In particular, lower bound #2 implies that the 2 n( - nlogn ' ) blow-up is inherent in the 
complementation and determinization of nondeterministic w-automata, corresponding to 
the 2 n blow-up of finite automata. The special case #3 justifies that Safra's construction is 
optimal in state complexity for the determinization of Biichi automata into Muller automata. 

^The gap hidden in the notation (Q(nk)) n can be at most c n for some c, while the gap hidden in the 
more widely used notation 2 e ( nlogn,s ) can be as large as (nk) n . 
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We single out this result because this determinization construction is touched in almost every 
introductory material on w-automata, and its optimality problem was explicitly left open in 
|Lod99j . 

For many of these transformations, it is still interesting to try to narrow the complexity 
gap, and here we discuss three of them. First, the complexity gap of Biichi complementation, 
although significantly narrowed, is still exponential. By analyzing the difference between 
the lower and upper bounds, one can find that the gap is mainly caused by the use of the 
state component O in |FKV06| to maintain the states along paths that have not visited 
an odd vertex since the last time O has been empty. So we should investigate how many 
states are really necessary for such a purpose. Second, for Streett complementation, the gap 
is still quite large. We feel that efforts should be first taken to optimize the construction 
in |KV05a| . Third, it is interesting to see if an Q(n n ) or similar lower bound exists for 
the determinization of NBWs into Muller or Rabin automata. Such would imply that 
determinization is harder than complementation for w-automata, unlike the case of automata 
over finite words. Of course, one can also work on the reverse direction, trying to design 
ranking based constructions for determinization, which could have good complexity bound 
as well as better applicability to practice. 

Finally, we remark that the full automata technique has been quite essential in obtaining 
our lower bound results. It is also possible to extend the full automata technique to other 
kinds of automata, like alternating automata or tree automata. We hope that the full 
automata technique will stimulate the discovery of new results in automata theory. 
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and the extremely valuable suggestions. I thank Enshao Shen for his kind support and 
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Appendix A. Numerical Analysis of L(ri) 

In this section, we prove that L(n) = 0((O.76n) n ). The analysis is very similar to the 
one in |FKV06j . but we still present it here for completeness. In the following, we write 
f(n) rs g(n) if two functions differ by only a polynomial factor in n. For example, by 
Stirling's formula, n! w (n/e) n . 

Let T(n, m) denote the number of functions from {1 . . . n} onto {1 . . . m}. The following 
estimate of T(n,m) is implicit in Temme [Tem93j: 

Lemma A.l. [Tem93] .For < f3 < 1, let x be the positive real number solving fix = l — e~ x , 
andleta = - mx + /31n(e x - 1) - (1-/3) + (1-/?) ln(l//3- 1). ThenT(n, [[5n\) « (M \J3] n) n , 

where M [f3] = e a ~ p (j^) 1 "^. 

To prove a lower bound for L(n), we first express L(n,m) in the following form: 
Lemma A.2. L(n,m) = J2t=m ( n 7 1 ) T (*> m ) mn ~ 1 ~* . 

Proof. To count the number of different (J(m)-ranking, we fix t, which denotes the number 
of states that have odd ranks. Then there are (™T ) ways to choose which t states have 
odd ranks, and there are T(t, m) ways to assign these t states the m different odd ranks. 
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Moreover, for each of the other n — 1 — t states in S' n , there are m ways to choose which 
even rank it is assigned. □ 

Theorem A. 3. L(n) = n((qn)"), where q = 0.76. 

Proof. By the previous lemma, L{n) = max Y^t=m ("T 1 )-^' m)m n ^ 1 ^ t . Since we do 
not care about polynomial factors, Yl?=m can be replaced by max , and we can replace 

t=m...n— 1 

ml by (m/e) m and ( n ~ ) by ff( - n "^ n _ f as well. Also let 7 = m/n and (5 = t/n, then we 
have: □ 

L(n) » max n n (/3n)" /3n ((l - /^n)-^)™ • (M[ 7 //3]/3n) /3n • ( 7 n) n " 1 - /3n 

0<7</3<l 

» max 7 )n) n , where /i (/?, 7) = (1 - ^^(Mfa/tfiyy 

0<7</3<l 

Computed by the Mathematica software, h(P,i) = 0.7645 when (3 = 0.7236,7 = 0.5744. 
So (0.76n)™ is an asymptotic lower bound for L(n). 
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